Hughes Trainable Text Skimmer: MUC-3 test results and analysis
نویسندگان
چکیده
Figure 1 gives the official results for the Hughes Trainable Text Skimmer used for MUC 3 (TTS-MUC3) . TTS is a largely statistical system, using a K-Nearest Neighbor classifie r with the output of a shallow parser as features. (See the System Summary section of thi s volume for a detailed description of TTS-MUC3). The performance, on a slot by slot basi s is, therefore, what one might expect: the pure set fills such as "Incident Type" and "Category" have much better performance than the string fills such as "Human Target ." In addition, we can see that "Incident Date" and "Incident Location," for which special code was written, have performance above that of the string fills.
منابع مشابه
Hughes Research Laboratories: description of the Trainable Text Skimmer used for MUC-4
The objective of the Hughes Trainable Text Skimmer (TTS) Project is to create text skimming softwar e that: (1) can be easily re-configured for new applications, (2) improves its performance with use, and (3) is fas t enough to process several megabytes of text per day. The TTS-MUC4 system is our second full-scale prototype . I t is an adaptation of the TTS-MUC3 system [1] [2], which constitute...
متن کاملHughes Trainable Text Skimmer: description of the TTS system as used for MUC-3
TTS-MUC3 incorporates semi-automated lexicon generation and almost fully automated phras e pattern generation. Associative retrieval from a case memory provides raw data for computing se t fills and string fills . TTS-MUC3's modular process model integrates the results of case memor y retrieval over sentences from multiple stories, extracts the date and location of incidents, an d computes cros...
متن کاملTools and techniques for rapid porting
Charlie Dolan, from Hughes Research Laboratories, discussed some of the difficulties in using trainable components in an information extraction system. The UMass/Hughes system used six different trainable components in their MUC5 system ; portability between the EJV and EME domains was achieved partl y through retraining these components. One of these components, the Trainable Template Generato...
متن کاملUnisys: MUC-3 test results and analysis
[A Linguistic Analysis Component . ] Although a natural language processing component was included in the design of the Unisys MUC-3 system as a third level of text analysis, not enough time was availabl e during the MUC-3 development cycle both to develop a knowledge-based information retrieval component and to port the Unisys Pundit text-processing system to the MUC 3 terrorist domain . A dec...
متن کامل